Opinion Summarization for Hotel Reviews

نویسندگان

  • Bogdan Marchis
  • Alexandru Tifrea
  • Mihai Volmer
  • Traian Rebedea
چکیده

This paper presents a new approach for finding the best ngrams that efficiently summarize a large set of reviews. The proposed unsupervised method uses a readability score and a representativeness score to select those n-grams that best convey the main opinions contained in the processed reviews. In order to further refine the selected n-grams, we use sentiment analysis and part of speech (POS) tagging to impose certain requirements that the n-grams that we are looking for should meet. Furthermore, the best n-grams were classified into several topics, which allowed a better prevention of redundancy among the summarizing n-grams. Therefore we offer an unsupervised, mostly non-aspect based, unstructured opinion summarization algorithm that can be easily implemented for any web platform that accepts reviews, due to its genericity. In order to assess the results of our algorithm, we summarized hotel reviews extracted for the TripAdvisor 1 website. The algorithm produces readable results that convey relevant opinions about the hotels that we used for testing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Decision Making through Polarized Summarization of User Reviews

When buying a mobile phone, booking an hotel, or watching a movie, many people rely on the reviews available on the Web. However, this huge amount of opinions make it difficult for users to have a comprehensive vision of the crowd judgments and to make an optimal decision. In this work we provide evidence that automatic text summarization of reviews can be used to design Web applications able t...

متن کامل

Classifying Hotel Reviews into Criteria for Review Summarization

Recently, we can refer to user reviews in the shopping or hotel reservation sites. However, with the exponential growth of information of the Internet, it is becoming increasingly difficult for a user to read and understand all the materials from a large-scale reviews. In this paper, we propose a method for classifying hotel reviews written in Japanese into criteria, e.g., location and faciliti...

متن کامل

A Review Corpus for Argumentation Analysis

The analysis of user reviews has become critical in research and industry, as user reviews increasingly impact the reputation of products and services. Many review texts comprise an involved argumentation with facts and opinions on different product features or aspects. Therefore, classifying sentiment polarity does not suffice to capture a review’s impact. We claim that an argumentation analys...

متن کامل

The Pareto Principle Is Everywhere: Finding Informative Sentences for Opinion Summarization Through Leader Detection

Most previous works on opinion summarization focus on summarizing sentiment polarity distribution towards different aspects of an entity (e.g., battery life and screen of a mobile phone). However, users’ demand may be more beyond this kind of opinion summarization. Besides such coarse-grained summarization on aspects, one may prefer to read detailed but concise text of the opinion data for more...

متن کامل

Low-Quality Product Review Detection in Opinion Summarization

Product reviews posted at online shopping sites vary greatly in quality. This paper addresses the problem of detecting lowquality product reviews. Three types of biases in the existing evaluation standard of product reviews are discovered. To assess the quality of product reviews, a set of specifications for judging the quality of reviews is first defined. A classificationbased approach is prop...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015